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Abstract 

Many, if not most network analysis algorithms have been designed specifi- 
cally for single-relational networks; that is, networks in which all edges are 
q \ of the same type. For example, edges may either represent "friendship," 

"kinship," or "collaboration," but not all of them together. In contrast, a 
CN ■ multi-relational network is a network with a heterogeneous set of edge labels 

^ ■ which can represent relationships of various types in a single data structure. 

r~- While multi-relational networks are more expressive in terms of the variety 

of relationships they can capture, there is a need for a general framework for 
transferring the many single-relational network analysis algorithms to the 
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multi-relational domain. It is not sufficient to execute a single-relational net- 
work analysis algorithm on a multi-relational network by simply ignoring edge 
labels. This article presents an algebra for mapping multi-relational networks 
to single-relational networks, thereby exposing them to single-relational net- 

^ \ work analysis algorithms, 

hi . 
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1. Introduction 



Much of graph and network theory is devoted to understanding and an- 
alyzing single-relational networks (also known as directed or undirected un- 
labeled graphs). A single- relational network is composed of a set of vertices 
(i.e. nodes) connected by a set of edges (i.e. links) which represent relation- 
ships of a single type. Such networks are generally defined as G = (V,E), 
where V is the set of vertices in the network, E is the set of edges in the net- 
work, and E C (V x V). For example, if i E V and j E V, then the ordered 
pair E E represents an edge from vertex i to vertex j. 2 Ignoring the 

differences in the vertices that edges connect, all edges in E share a single 
nominal, or categorical, meaning. For example, the meaning of the edges in 
a single-relational social network may be kinship, friendship, or collabora- 
tion, but not all of them together in the same representation as there is no 
way distinguish what the edges denote. Moreover, single-relational networks 
may be weighted, where w : E — > R is a function that maps each edge in 
E to a real value. Weighted forms are still considered single-relational as all 
the edges in E have the same meaning; the only difference is the "degree of 
meaning" defined by w. 

The network, as a data structure, can be used to model many real and 
artificial systems. However, because a network representation of a system 
can be too complicated to understand directly, many algorithms have been 
developed to map the network to a lower dimensional space. For a fine review 
of the many popular network analysis algorithms in use today, refer to [1, 2]. 
Examples of such algorithms, to name a few of the more popularly used 
algorithms, include the the family of geodesic [3, 4, 5, 6], spectral [7, 8, 9], 
and community detection algorithms [10, 11, 12]. 

Most network analysis algorithms have been developed for single-relational 
networks as opposed to multi-relational networks. A multi-relational network 
is composed of two or more sets of edges between a set of vertices. A multi- 
relational network can be defined as M = (V, E), where V is the set of vertices 
in the network, E = {Ei, E 2 , . . . , E m } is a family of edge sets in the network, 
and any Ek C (V x V) : 1 < k < m. Each edge set in E has a particular nom- 
inal, or categorical, interpretation. For example, within the same network 



2 This article is primarily concerned with directed networks, as opposed to undirected 
networks, in which an edge is an unordered set of two vertices (e.g. {i,j}). However, the 
formalisms presented work with undirected networks. 
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M, E 1 ,E 2 G E may denote "kinship" and "co-authorship," respectively. 

The multi-relational network is not new. These structures have been used 
in various disciplines ranging from cognitive science and artificial intelligence 
[13] (e.g. semantic networks for knowledge representation and reasoning) to 
social [1] and scholarly [14, 15] modeling. Furthermore, the multi-relational 
network is the foundational data structure of the emerging Web of Data 
[16, 17]. While a multi-relational network can be used to represent more 
complicated systems than a single-relational network, unfortunately, there 
are many fewer multi-relational algorithms than single-relational algorithms. 
Moreover, there are many more software packages and toolkits to analyze 
single-relational networks. Multi-relational network analysis algorithms that 
do exist include random walk [18], unique path discovery [19], community 
identification [20], vertex ranking [21], and path ranking algorithms [22]. 

The inclusion of multiple relationship types between vertices complicates 
the design of network algorithms. In the single-relational world, with all 
edges being "equal," the executing algorithm need not be concerned with 
the meaning of the edge, but only with the existence of an edge. Multi- 
relational network algorithms, on the other hand, must take this information 
into account in order to obtain meaningful results. For example, if a multi- 
relational network contains two edge sets, one denoting kinship (E\ e E) 
and the other denoting coauthorship (E 2 £ E), then for the purposes of a 
scholarly centrality algorithm, kinship edges should be ignored. In this sim- 
ple case, the centrality algorithm can be executed on the single-relational 
network defined by G = (V, E 2 ). However, there may exist more complicated 
semantics that can only be expressed through path combinations and other 
operations. Thus, isolating single-relational network components of M is not 
sufficient. As a remedy to this situation, this article presents an algebra for 
defining abstract paths through a multi-relational network in order to de- 
rive a single-relational network representing vertex connectivity according to 
such path descriptions. From this single-relational representation, all of the 
known single-relational network algorithms can be applied to yield mean- 
ingful results. Thus, the presented algebra provides a means of exposing 
multi-relational networks to single-relational network analysis algorithms. 

2. Path Algebra Overview 

This section provides and overview of the various constructs of the path 
algebra and primarily serves as a consolidated reference. The following sec- 
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tions articulate the use of the constructs summarized here. 

The purpose of the presented algebra is to transform a multi-relational 
network into a "semantically-rich" single-relational network. This is ac- 
complished by manipulating a three-way tensor representation of a multi- 
relational network. The result of the three-way tensor manipulation yields 
an adjacency matrix (i.e. a two-way tensor). 3 The resultant adjacency matrix 
represents a "semantically-rich" single-relational network. The "semantically- 
rich" aspect of the resultant adjacency matrix (i.e. the single-relational net- 
work) is determined by the algebraic path description used to manipulate the 
original three-way tensor (i.e. the multi-relational network). More formally, 
the multi-relational path algebra is an algebraic structure that operates on 
n x n adjacency matrix "slices" ofanxnxm three-way tensor representation 
of a multi-relational network in order to generate a n x n path matrix. The 
generated nxn path matrix represents a "semantically-rich" single-relational 
network that can be subjected to any of the known single-relational net- 
work analysis algorithms. The path algebra is a matrix formulation of the 
grammar-based random walker framework originally presented in [18]. How- 
ever, the algebra generates "semantically-rich" single-relational networks, as 
opposed to only executing random walk algorithms in a "semantically-rich" 
manner. In other words, the algebra is cleanly separated from the analysis 
algorithms that are ultimately applied to it. This aspect of the algebra makes 
it generally useful in many network analysis situations. 

The following list itemizes the various elements of the algebra to be dis- 
cussed in §3. 

• A G {0, i} nxriXm : a three-way tensor representation of a multi-relational 
network. 4 

• Z G M™ xn : a path matrix derived by means of operations applied to A. 

• Rj G {0, l} nxri : a row "from" path filter. 

• d G {0, l} nxn : a column "to" path filter. 



3 The term tensor has various meanings in mathematics, physics, and computer science. 
In general, a tensor is a structure which includes and extends the notion of scalar, vector, 
and matrix. A zero-way tensor is a scalar, a one-way tensor is a vector, a two-way tensor 
is a matrix, and a three-way tensor is considered a "cube" of scalars. For a three-way 
tensor, there are three indices used to denote a particular scalar value in the tensor. 

4 This article is primarily concerned with boolean tensors. However, note that the 
presented algebra works with tensors in ]R™ xnxm . 
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• Eij E {0, l}™ xri : an entry path filter. 

• I 6 {0, l} nxn : the identity matrix as a self-loop filter. 

• 1 G l nxn : a matrix in which all entries are 1. 

• G nxn : a matrix in which all entries are 0. 

The following list itemizes the various operations of the algebra to be dis- 
cussed in §4. 

• AB: ordinary matrix multiplication determines the number of (A, B)- 
paths between vertices. 

• A T : matrix transpose inverts path directionality. 

• A o B: Hadamard, entry-wise multiplication applies a filter to selec- 
tively exclude paths. 

• n(A): not generates the complement of a {0, l} nxn matrix. 

• c(A): clip generates a {0, l} nxn matrix from a R" xri matrix. 

• v ± (A): vertex generates a {0, l} nxn matrix from a M" xri matrix, where 
only certain rows or columns contain non-zero values. 

• AA: scalar multiplication weights the entries of a matrix. 

• A + B: matrix addition merges paths. 

In short, any abstract path a is a series of operations on A that can be 
generally defined as 

a : {0, l}" x " xm _, R" x ™. 

The resultant R™ xra matrix is an adjacency matrix. Thus, the resultant ma- 
trix is a single-relational network. This resultant single-relational network 
can be subjected to single-relational network analysis algorithms while still 
preserving the semantics of the original multi-relational network. 

Throughout the remainder of this article, scholarly examples are provided 
in order to illustrate the application of the various elements and operations 
of the path algebra. All of the examples refer to a single scholarly tensor 
denoted A. The following list itemizes the tensor "slices" and their domains 
and ranges, where H C V is the set of all humans, A C V is the set of all 
articles, J C V is the set of all journals, S C V is the set of all subject 
categories, and P C V is the set of all software programs: 

• A 1 : authored : H — > A 
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• A 2 : authoredBy : A — > H 

• A 3 : cites : A —> A 

• A 4 : contains : J — > A 

• „4 5 : category : J — > S 

• A 6 : developed : H —> P. 

3. Path Algebra Elements 

This section introduces the elements of the algebra and the next section 
articulates their use in the various operations of the algebra. The elements 
of the path algebra are structures that are repeatedly used when mapping a 
multi-relational tensor to a single-relational path matrix. 

3.1. Three-way Tensor Representation of a Multi-Relational Network 
A single-relational network defined as 

G = (V,E C (V x V)) 

can be represented as the adjacency matrix A, where 



l X(i,j)eE 

otherwise 



without loss of information. This adjacency matrix is also known as a two- 
way tensor because it has two dimensions, each with an order of n, where 
n = \ V\. Stated another way, A e {0, l} nxn . 

A three-way tensor can be used to represent a multi-relational network 
[23]. If 

M = (V, E = {Ek, E 2 , . . . , E m C (V x V)}) 
is a multi-relational network, then 



1 if (i, j) G E k : 1 < k < m 
otherwise. 



In this formulation, two dimensions have an order of n while the third has 
an order of m, where m — |E|. Thus, A £ {0, l} nxnxm and any adjacency 
matrix "slice" A k G {0, l} riXra : I < k < m. A represents the primary 
structure by which individual adjacency matrices are indexed and composed 
in order to derive a resultant single-relational path matrix. 
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3.2. Path Matrices 

The path algebra operates on n x n adjacency matrix elements of A to 
construct a "semantically-rich" path matrix. The simplest path is made up 
of a single edge type. Thus, A k is a path matrix for vertex-to-vertex paths 
of length 1. The meaning of that path is simply defined by the meaning of 
Efe. When constructing complex paths through A (when utilizing multiple 
edge types in .4.), the resulting matrix may have entries with values greater 
than 1. Furthermore, in conjunction with the use of R + scalar multiplica- 
tion (described in §4.3), entries of a path matrix may be non-negative real 
numbers. Therefore, in general, all path matrices discussed throughout the 
remainder of this article are matrices in M™ xn and are denoted Z. In short, 
the resultant Z matrix can be seen as a positively weighted single-relational 
network. In other words, Z denotes a single-relational network of the form 
G = (V, E, w), where w: E -> R+. 

3.3. Filter Matrices 

Filters are used to ensure that particular paths through A are either in- 
cluded or excluded from a path composition. Generally, a filter is a {0, l} raxra 
matrix. Filters may be derived from intermediate path matrices (described 
in §4.2) or may target a specific vertex that is known a prior. A vertex- 
specific filter is either a row Rj e {0, l} riX ™ 5 column C* G {0, l} nxn ; or entry 
Eij G {0, l} nxn filter. 

1. A row filter is denoted Rj, where all entries in row i are equal to 1 and 
all other entries are equal to 0. Row filters are useful for allowing only 
those paths that have an origin of %. 

2. A column filter is denoted Cj, where all entries in column % are equal to 
1 and other entries are equal to 0. Column filters are useful for allowing 
only those paths that have a destination of i. 

3. Entry filters are denoted Ejj and have a 1 at the (i,j)-entry and 
elsewhere. Entry filters allow only those paths that have an origin of % 
or a destination of j. 

Useful properties of the vertex-specific filters include: 

• R, = CJ 

• Q = RJ 
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The identity matrix I is useful for allowing or excluding self-loops. Finally, 
the filter 1 E l nxn is a matrix in which all entries are equal to 1, and E nxn 
is a matrix in which all entries are equal to 0. 

4. Path Algebra Operations 

The previous section defined the common elements of the path algebra. 
These elements are composed with one another to create a path composition. 
This section discusses the various operations that are used when composing 
the aforementioned elements. 

4-1. The Traverse Operation 

A useful property of the single-relational adjacency matrix A E {0, l} nxn 
is that when it is raised to the t th power, the entry A-'j is equal to the number 
of paths of length t that connect vertex % E V to vertex j E V [24]. This 
is simple to prove using induction. Given, by definition, that A^j* (i.e. Ajj) 
represents the number of paths that go from i to j of length 1 (i.e. a single 
edge) and by the rules of ordinary matrix multiplication, 

AiJ> = y><r» - A„ : * > 2. 

lev 

The same mechanism for finding the number of paths of length t in a 
single-relational network can be used to find the number of semantically 
meaningful paths through a multi-relational network. For example, suppose 
A 1 has the label authored, A 2 has the label authoredBy, A 3 has the label 
cites, and 

Z = A 1 ■ A 3 ■ A 2 . 

Semantically, Zjj is the number of paths from vertex i to vertex j such that 
a path goes from author i to one the articles he or she has authored, from 
that article to one of the articles it cites, and finally, from that cited article 
to its author j. 5 The meaning of Z is hasCited : H — > H and represents an 
edge if some author has cited some other author by means of their respective 
articles (i.e. an author citation network). This method is analogous to raising 



If vertex i is not an author, then such a path composition would yield paths from i 
to any j. A path composition must respect the domains and ranges of the edge types if a 
meaningful path matrix is to be generated. 
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an adjacency matrix to the t th power, except that, in place of multiplying the 
same adjacency matrix with itself, a sequence of different adjacency matrix 
"slices" in A are used to represent typed paths with a compositionally defined 
meaning. 

It is worth noting that ordinary matrix multiplication is not commutative. 
Thus, for the most part, when A^B, AB^BA. Given paths through a 
multi-relational network, this makes intuitive sense. The path from authored 
to cites is different than the path from cites to authored. In the first 
case, if the resultant path is seen as a mapping, then authored : H — > A 
(i.e. human to article) and cites : A — > A (i.e. article to article). Thus, 
through composition cites o authored : H — > A. 6 However, in the latter 
case, composition is not possible as cites has a range of an article and 
authored has a domain of human. 

Finally, any n x n adjacency matrix element of A can be transposed in 
order to traverse paths in the opposite direction. For example, given Ai and 
A2 identified as authored and authoredBy, respectively, where A 1 = A 2 , 
then Z = A 1 ■ A 3 ■ A 2 = A 1 ■ A 3 ■ A lT . Thus, inverse edge types can be created 
using matrix transpose. 



4-2. The Filter Operation 

In many cases, it is important to exclude particular paths when traversing 
through A. Various path filters can be defined and applied using the entry- 
wise Hadamard matrix product denoted o [25], where 



A o B = 



A11 ■ Bi 1 ■ ■ ■ Ai m ■ Bi 



A , ■ R , ... A • R 

-"-71,1 1 - J n,l -"-ra, m 1J n,m 



The following list itemizes various properties of the Hadamard product: 

• Ao 1 = A 

• A o = 

• AoB = B o A 

• A o (B + C) = (A o B) + (A o C) 



6 The symbol o is overloaded in this article meaning both function composition and the 
Hadamard matrix product. The context of the symbol indicates its meaning. 
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• Ao AB = A(AoB) 

• A T oB T = (A o B) T . 

If A is a {0, l} nxn matrix, then A o A = A. For the row, column, and entry 
filters, 

• Rj o Ft, = : iy^j 

• d oCj = : i^j 

• Rj o Cj = Ejj. 

Finally, if Z e R+ Xn and Z has a trace of (i.e. no self-loops), then Zol = 0. 

The Hadarmard product is used in the path algebra to apply a filter. As 
stated previously, a typical filter is a {0, l} nxn matrix, where entries set 
the corresponding path counts in Z to 0. The following subsections define 
and illustrate some useful functions to generate filters. 

4.2.1. The Not Function 

The not function is defined as 

n : {0,l} nxn -> {0,l} nxn 

with a function rule of 

n(A) = 1 - A. 

In words, the not takes a {0, l} nxri matrix and replaces all the 0s with Is and 
all the Is with 0s. Some evident and useful properties of a not filter include 

• n(n(A)) = A 

• A o n(A) = 

• n(A) o n(A) = n(A). 

Furthermore, if Z e M" xn and Z has a trace of 0, then Z o 77,(1) = Z. 

A not function is useful for excluding a set of paths to or from a vertex. 
For example, when constructing a coauthorship path matrix where A 1 repre- 
sents authored, the operation A 1 ■ A lT o n(I) will ensure that the authored 
relationship is taken and then the transpose of authored (i.e. authoredBy) 
is taken. However, if only these two operations are applied, then this does 
not yield a coauthorship matrix, as the traversal returns to the originating 
vertex (i.e. vertex i is considered a coauthor of vertex i). Thus, the applied 
not-identity filter will remove all paths back to the source vertex, at which 
point a coauthorship path matrix is generated. 
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4-2.2. The Clip Function 

The clip function maps an R" x ™ path matrix to a {0, l} nxn matrix. The 
function is defined as 

c: R r l xn -> {0,l} nxn 

with a function rule of 

10 otherwise. 

The general purpose of clip is to take a non-{0, l} nxn path matrix and to 
"clip," or normalize, it to a {0, l} nx ™ matrix. Thus, clip creates a filter that 
can then be applied to a composition to exclude paths. 

If A G {0, l} nxn ; then evident and useful properties are 

• c(A) = A 

• c(n(A)) = n(c(A)) = n(A). 
Proposition 1. //Y,Z e R" xri ; tfaera 

c(YoZ) = c(Y)oc(Z). 

Proof. This property can be demonstrated given an entry-wise representa- 
tion, where 

c (Y;j • Zjj) = c(Yjj) • c(Zjj). 

The equality holds for all cases where both entries are 0, both entries are 
greater than 0, and where one entry is and the other is greater than 0. □ 

Proposition 2. If A,B E {0, l} nxn , then 

n(AoB) = c(n(A) +n(B)) 

Proof. This proposition follows a similar pattern as De Morgan's law for 
boolean values, where ->{P A Q) = ->P V ->Q. However, because matrix 
addition over {0, l} riX ™ matrices has the potential to yield a value of 2 if 
n(Aij) = n(Bjj) = 1, clip will ensure that c(n(A) + c(B))jj = 1. □ 
Likewise, 

Proposition 3. If A,B E {0, l} nxn , then 

n(c(A + B)) = n(A)on(B) 
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Proof. This proposition follows a similar pattern as De Morgan's law for 
boolean values, where -i(PVQ) = ->P A ->Q. □ 
To use a scholarly example, it is possible to exclude all coauthorship 
and self-loop paths from a larger composite. For instance, if, as previously 
demonstrated, 

A 1 ■ A lT o n (I) 

defines a coauthorship path matrix and A 3 denotes cites relations, then 
Z = (A 1 ■ A 3 ■ A lT ^j on (c (A 1 ■ A lT o n (I))) o n (I) 

.J , , no self 

cites no coauthors 

is a hasCited' path matrix where citing one's coauthors and oneself is not 
considered a legal citation path. As previously demonstrated in §4.1, the first 
component (i.e. cites) generates a hasCited path matrix for all authors citing 
each other's articles, where coauthorship and self-citation are legal. The sec- 
ond component (i.e. no coauthors) applies the not function to a path matrix 
generated from a clip of a coauthorship path matrix. This excludes coau- 
thors as being legal author citations. Finally, the third component (i.e. no 
self) disallows self-loops. The application of the two filter components re- 
moves paths that go from an author to his- or herself as well as to his or her 
respective coauthors. 

With the help of the propositions and properties of the various opera- 
tions of the path algebra, the above composition can be simplified. While 
the following simplification is lengthy, it utilizes many of the properties and 
propositions demonstrated hitherto. If X = A 1 ■ A 3 ■ A lT and Y = A 1 ■ A lJ , 
then 



Z = Xon(c(Yon(I)))on(I) 

= Xon(c(Y)oc(n(I))on(I) 

= Xon(c(Y)on(I))on(I) 

= Xoc(n(c(Y))+n(n(I)))on(I) 

= X o c(n(c(Y)) + I) o n(I) 

= Xoc(n(c(Y))+I)oc(n(I)) 

= Xoc(n(c(Y)) + Ion(I))) 

= Xoc(n(c(Y))on(I)+Ion(I))) 

= Xoc(n(c(Y))on(I)) 

= Xoc(n(c(Y)))oc(n(I)) 

= Xon(c(Y))on(I) 



prop. 1 
c(n(A)) = A 
prop. 2 
n(n(A)) = A 
c(n(A)) = n(A) 
prop. 1 

A o (B + C) = (A o B) + (A o C) 
A o n(A) = 
prop. 1 

c(n(A)) = n(A). 
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Thus, 

Z = (A 1 ■ A 3 ■ A lT ^j o n (c (A 1 ■ A lT )) o n(I). 

In words, only a single filter disallowing self-loops is necessary to yield the 
same result. 

4-2.3. The Vertex Functions 

In many cases, it is important to filter out particular paths from and to 
a vertex. Two useful functions are v~ and v + , where 

v~ : W r _l xn xN^{0,l} nxn , 



otherwise 



turns a complete row into an all 1-row if the sum of row entries is greater 
than p and 

D + :K; xn xN^{0,l}" x ", 



otherwise 



turns a complete column into an all 1-column if the sum of the column 
entries is greater than p. The function v~ is used to select paths outgoing 
from particular vertices and v + is used to select paths incoming to particular 
vertices. Moreover, by providing p, it excludes those vertices with less than 
p paths outgoing from or incoming to it. For the sake of brevity, when no p 
is supplied, it is assumed that p — 0. 

Some useful properties of the vertex filter are 

• u-(Ri) = R, 

• v + (C t ) = d 

• v + (E id )ov-(E iJ )=-E iJ 

• «-(ZoRi) = u-(Z)oRi 

• u+(ZoCi) =u+(Z)oCi 
. v~(Z,p)=v + (Z T ,p) T 

. v + (Z,p)=v-(Z T ,p) T . 
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To demonstrate the use of the vertex function, consider the multi-relational 
tensor A that includes journals, articles, and subject categories, where A 3 
denotes cites : A — > A, A 4 denotes contains : J — > A, A 5 denotes 
category : J — > S, and vertex 1 denotes the subject category "social sci- 
ence." A social science journal citation matrix can be created from A in which 
a path exists between journals if and only if an article contained in a journal 
cites an article contained in another journal. Furthermore, only those citing 
and cited articles are considered that are in social science journals. Thus, 
the social science journal citation path matrix is defined as 

Z = Ur (Ci o A 5 ) o A 4 ] -A 3 ■ [A 4T o v + (Rx o A 5 ^ 



"V™ 



soc.sci. journal articles - m sQc sd journals 

First, a vertex-created filter is applied to remove all articles that are not 
contained in social science journals. Next, the articles that these articles 
cite is determined. Finally, those articles not in social science journals are 
filtered out using another vertex-created filter. Thus, a citation path matrix 
is generated which only includes social science journals. 

Using the various aforementioned vertex function properties, the above 
expression for a social science journal citation path matrix can be simplified, 
because 



i;+(Rio.A 5T ) = v + ((C l0 ^ 5 ) T ) 
= r-(C l0 ^ 5 ) T 



Ri — Cj 

v~(Z) = 
r+(Z T ) T . 



Therefore, given the above and because A T o B T = (A o B) T , 

Z= [v~ (C l0 ^ 5 )o^ 4 ] -A 3 - [v~ {^oA 5 ) oA 4 ] T . 

The above composition reuses the computation for determining which articles 
are contained in social science journals by simply reversing the directionality 
of the final component. The two bracketed components represent contains 
such that the domain is social science journals and the range is articles. The 
ability to algebraically manipulate path expressions is one of the primary 
benefits of utilizing an algebraic structure to map a multi-relational network 
to a single-relational network. 



14 



4-3. The Weight Operation 

Composed paths can be weighted using ordinary matrix scalar multipli- 
cation. Given a scalar value of A G 1, AZ will weight all the paths in Z by 
A. This operation is useful when merging path matrices in such a way as to 
make one path matrix more or less significant than another path matrix. The 
next subsection, §4.4, discusses the operation of merging two path matrices 
and presents an example that includes the weight operation. 

4-4- The Merge Operation 

Ordinary matrix addition can be used to merge two path matrices. For 
example, consider the multi-relational tensor A, where A 1 denotes authored 
and A 6 denotes developed, authored maps humans to articles, and developed 
maps humans to programs. Furthermore, consider a definition of collabora- 
tion that includes both the coauthorship of articles and the co-development 
of software, where article coauthorship is weighted as being slightly more 
important than co-development. The path matrix 



merges the article and software program collaboration path matrices as spec- 
ified by their respective weights of 0.6 and 0.4. The resultant path matrix 
denotes article and software program collaboration. Finally, using the prop- 
erties and propositions of the path algebra, a simplification of the previous 
composition is 



5. Network Analysis Applications 

The previous sections presented various elements and operations of the 
path algebra that can be applied to a multi-relational tensor in {0, l| rax?ixrn 
in order to derive a "semantically-rich" single-relational path matrix in IR" xn . 
The resultant path matrix yields the number of paths from vertex % to vertex 
j as determined by the operations performed. The path matrix can be con- 
sidered a weighted single-relational network of the form G = (V, E, w), where 
w : E — > R + . Many single-relational network analysis algorithms require ei- 
ther a {0, l} nxri matrix or a [0, l]™ xri weighted or stochastic matrix. The 




coauthorship 



co-development 
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resultant path matrix can be manipulated in various ways (e.g. normalized 
out going weight distribution) to yield a matrix that can be appropriately 
used with the known single-relational network analysis algorithms. This sec- 
tion discusses the connection of a path matrix to a few of the more popular 
single-relational network analysis algorithms. However, before discussing the 
single-relational network analysis algorithms, the next subsection discusses 
the relationship between the path algebra and multi-relational graph query 
languages. 

5.1. Relationship to Multi- Relational Graph Query Languages 

The presented path algebra has many similarities to graph query lan- 
guages such as GraphLog [26], Path Query Language (PQL) [27], and SPARQL 
Protocol and RDF Query Language (SPARQL) [28]. All of these languages 
serve a similar function of querying a graph for data, though they have dif- 
ferent levels of expressivity (e.g. some have mechanisms to include extension 
functions in the query, to perform regular expressions on the vertex names, 
and the ability to perform recursion). However, what these languages have 
in common is the ability to perform graph pattern matching. Graph pattern 
matching is explained using an example. Suppose the multi-relational net- 
work representation H C (V x f2 x V), where V is the set of vertices and 
is the set of edges labels. Given 

Z = {lx | (?x, authoredBy, marko) G H}, 

Z is the set of all articles authored by Marko. In short, ?-variables are used 
to bind to particular vertices and must hold for the duration of the query. 
This is made more salient in the following, more complicated example: 

Z ={ly | (?rr, authoredBy, marko) G H 
A (?x, cites, ly) G H 
A (JOI, contains, ly) G H 

Mx ^?y}. 

In this example, the set Z is the set of all articles that are 

1. contained in the Journal of Informetrics (JOI), 

2. cited by Marko's authored articles, and 

3. are not articles authored by Marko. 
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In SPARQL, this is represented as 



SELECT ?y WHERE { 

?x authoredBy marko . 
?x cites ?y . 
JOI contains ?y . 
FILTER (?x != ?y) }. 

In the presented path algebra, this same query is represented as 

/ \ 



V 



A 3 o n I v- ( C 2 o A lT 



4T 



marko's articles 



citations to non-marko articles 



C 3 oA 



contained in joi J 



where vertex 2 is Marko and vertex 3 is the Journal of Informetrics. The 
resulting Z path matrix is a {0,l}-matrix, where the row vertices that have a 
1 as an entry are equivalent to those vertices that bound to ?y in the related 
SPARQL query. While the algebraic form may be considered more cumber- 
some than the SPARQL form, the benefit of using the path algebra is that 
the query can be simplified through algebraic manipulations, and that it has 
a convenient implementation using existing linear algebra toolkits. Moreover, 
a hybrid approach can be utilized that leverages the syntactic convenience of 
the standard graph query languages and the algebraic properties of the path 
algebra. That is, as many of the queries used in graph query languages can 
be specified in the path algebra, it is possible to optimize such queries in the 
algebra and then convert them back to the query language for execution. 

Finally, the path algebra provides more information than a binding of 
variables to vertices. For instance, in the previous example, without the clip 
function, the resultant Z would return how many paths exist from Marko's 
articles to those non-Marko cited articles that are contained in the Journal 
of Informetrics. In this way, some Journal of Informetrics articles may be 
deemed more "appropriate" to the query (as there may be more paths to 
them). The result path matrix can be seen as a weighted single-relational 
network that can be manipulated further by single-relational network analysis 
algorithms or used in a larger path expression. 

5.2. Shortest Path Calculation 

The family of geodesic algorithms are based on the calculation of the 
shortest path between vertices [1]. Example shortest path metrics include: 
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• eccentricity: defined for a vertex as the longest shortest path to all 
other vertices [6] , 

• radius: defined for the network as the smallest eccentricity value for all 
vertices, 

• diameter: defined for the network as the largest eccentricity value for 
all vertices, 

• closeness: defined for a vertex as the mean shortest path of a vertex 
to all other vertices [3], 

• betweenness: defined for a vertex as the number of shortest paths that 
a vertex is a part of [5]. 

A straightforward (though computationally expensive) way to calculate 
the shortest path between any two vertices i and j in an adjacency matrix 
A is to continue to raise the adjacency matrix by a power until Afj > 0. 

The first t where A-*] > denotes the length of the shortest path between 
vertex i and j as t. This same principle holds for calculating the shortest 
path in a path matrix Z and thus, the resulting path matrix generated from 
a path composition can be used to determine "semantically-rich" shortest 
paths between vertices. 

5.3. Diffusing an Energy Vector through A 

Many network analysis algorithms can be represented as an energy diffu- 
sion, where 

/ : Wf x Wf xn -> Wf 

maps a row- vector of size n (i.e. "energy vector") and a matrix of size n x 
n to a resultant energy vector of size n. Algorithms of this form include 
eigenvector centrality [7], PageRank [8], and spreading activation [29, 30] to 
name but a few. 

With respect to eigenvector centrality and PageRank, the general form 
of the algorithm can be represented as a problem in finding the primary 
eigenvector of an adjacency matrix such that 7rA = Xn, where 7r 6 I n and 
7r is the energy vector being diffused. Both algorithms can be solved using 
the "power method." In one form of the power method, the solution is found 
by iteratively multiplying 7r by A until n converges to a stable set of values 
as defined by ||7r(* _1 ) — 7r^|| 2 < e for some small e G R + . In another form, 
the problem is framed as finding which t-power of A will yield a n such that 



18 



ttA.® = \ir. With respect to spreading activation, the same power method 
can be used; however, the purpose of the algorithm is not to find the primary 
eigenvector, but instead to propagate an energy vector some finite number 
of steps. Moreover, the total energy flow through each vertex at the end of 
a spreading activation algorithm is usually what is determined. 

5.3.1. The PageRank Path Matrix 

The PageRank algorithm was developed to determine the centrality of 
web pages in a web citation network [8] and since, has been used as a general 
network centrality algorithm for various other types of networks including 
bibliographic [31, 32], social [33], and word networks [34]. The web citation 
network can be defined as G = (V, E), where V is a set of web pages and 
E C (V x V) is the set of directed citations between web pages (i.e. href). 
The interesting aspect of the PageRank algorithm is that it "distorts" the 
original network G by overlaying a "teleportation" network in which every 
vertex is connected to every other vertex by some weight defined by 5 G (0, 1]. 
The inclusion of the teleportation network ensures that the resulting hybrid 
network is strongly connected 7 and thus, the resultant primary eigenvector 
of the network is a positive real-valued vector. 



In the matrix form of PageRank, there exist two adjacency matrices in 
[0, l] nxn denoted 



where T(i) is the out degree of vertex %. V 1 is a row-stochastic adjacency 
matrix and V 2 is a fully connected adjacency matrix known as the teleporta- 
tion matrix. The purpose of PageRank is to identify the primary eigenvector 
of a merged, weighted path matrix of the form 



Z is guaranteed to be a strongly connected single-relational path matrix 



7 Strongly connected means that there exists a path from every vertex to every other 
vertex. In the language of Markov chains, the network is irreducible and recurrent. 




and 




1 



\vy 



Z = 5V 1 + (1- 5)V 2 . 
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because there is some probability (defined by 1 — 8) that every vertex is 
reachable by every other vertex. 

5.3.2. Constrained Spreading Activation 

The concept of spreading activation was made popular by cognitive sci- 
entists and the connectionist approach to artificial intelligence [29, 35, 36], 
where the application scenario involves diffusing an energy vector through an 
artificial neural network in order to simulate the neural process of a spreading 
activation potential in the brain. Spreading activation can be computed in a 
manner analogous to the power method of determining the primary eigenvec- 
tor of an adjacency matrix. However, spreading activation does not attempt 
to find a stationary energy distribution in the network and moreover, usually 
includes a decay function or activation/threshold function that can yield a 
resultant energy vector whose sum is different than the initial energy vector. 

The neural network models of connectionism deal with weighted, single- 
relational networks. Spreading activation was later generalized to support 
multi-relational networks [30]. While there are many forms of spreading 
activation on single-relational networks and likewise, many forms on multi- 
relational networks, in general, a spreading activation algorithm on a multi- 
relational network is called constrained spreading activation as not all edges 
to and from a vertex are respected equally. Constrained spreading activation 
has been used for information retrieval on the web [30, 37, 38], semantic 
vertex ranking [18, 39], and collaborative filtering [40]. The path algebra 
provides a means by which to separate the spreading activation algorithm 
from the data structure being computed on. That is, the constrained aspect 
of the algorithm is defined by the path algebra and the spreading activation 
aspect of the algorithm is defined by standard, single-relational spreading 
activation. 

5.4- Mixing Patterns 

Given a network and a scalar or categorical property value for each vertex 
in the network, it is possible to determine whether the network is assorta- 
tive or disassortative with respect to that property [41]. Example scalar 
properties for journal vertices in a scholarly network include impact factor 
ranking, years in press, cost for subscription, etc. Example categorical prop- 
erties include subject category, editor in chief, publisher, etc. A network is 
assortative with respect to a particular vertex property when vertices of like 
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property values tend to be connected to one another. Colloquially, assorta- 
tivity can be understood by the phrase "birds of a feather flock together," 
where the "feather" is the property binding individuals [42]. On the other 
hand, a network is disassortative with respect to a particular vertex property 
when vertices of unlike property values tend to be connected to one another. 
Colloquially, disassortativity can be understood by the phrase "opposites 
attract." 

There are two primary components needed when calculating the assor- 
tativity of a network. The first component is the network and the second 
component is a set of property values for each vertex in the network. A pop- 
ular publication defining the assortative mixing for scalar properties uses the 
parametric Pearson correlation of two vectors [43]. 8 One vector is the scalar 
value of the vertex property for the vertices on the tail of all edges. The 
other vector is the scalar value of the vertex property for the vertices on the 
head of all the edges. Thus, the length of both vectors is \E\ (i.e. the total 
number of edges in the single- relational network). Formally, the correlation 
is defined as 

= \ E \ YnJiki - Y,j3i Y.i k i 

\I[\e\ EJ? ~ (EiJi) 2 ] [\e\ Y~k 2 i - (Eihf] ' 

where ji is the scalar value of the vertex on the tail of edge i, and ki is the 
scalar value of the vertex on the head of edge i. The correlation coefficient r 
is in [—1,1], where —1 represents a fully disassortative network, represents 
an uncorrelated network, and 1 represents a fully assortative network. On 
the other hand, for categorical properties, the equation 

yields a value in [—1,1] as well, where e aa is the number of edges in the 
network that have property value a on both ends, i a is the number of edges 
in the network that have property value a on their tail vertex, and j a is the 
number of edges that have property value a on their head vertex [41]. 



8 Note that scalar value distributions may not be normally distributed and thus, in 
such cases, a non-parametric correlation such as the Spearman p may be the more useful 
correlation coefficient. 
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Given a path matrix, assortativity is calculated on paths, not on edges, 
where there may be many paths (possibly weighted) between any two ver- 
tices. 9 Thus, a weighted correlation is required [44]. Let ji denote the scalar 
property value of the vertex on the tail of path i and ki denote the scalar 
property value of the vertex on the head of path i. The previous scalar as- 
sortativity equation can be generalized such that if z; t is the fraction of path 
weight in Z for path i, then 



r = 



cov ijfc 



^cov^cov^. 



where 



cov ijfc = 



Ei* 



Ji 



Ej z ih 



Similarly, for categorical vertex properties, 

_ Eg e aa ~ Eg Wa 
1 ~~ Eg iaja 

where e a g is the total path weight of paths that have tail and head vertices 
with a property value of a, i a is the total path weight of all paths that have 
a tail vertex with a property value of a, and j a is the total path weight of all 
paths that have a head vertex with a property value of a. 



6. Conclusion 

The number of algorithms and toolkits for single-relational networks far 
exceeds those currently available for multi-relational networks. However, 
with the rising popularity of the multi-relational network, as made evident by 
the Web of Data initiative and the multi- relational RDF data structure [45] , 
there is a need for methods that port the known single-relational network 
analysis algorithms over to these multi-relational domains. A convenient 



It is important to note that with multi-relational networks, vertex property values 
can be encoded in the network itself. For instance, given the scholarly network example 
of previous, the subject category "social science" is a vertex adjacent to a set of journal 
vertices according to the relation category : J — > S. More generally, vertex property 
values may be determined through path composition. 
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method to do so is a path algebra. Path algebras have been used extensively 
for analyzing paths in single-relational networks [46, 47] and the applica- 
tion to multi-relational networks can prove useful. The multi-relational path 
algebra presented in this article operates on an n x n x m tensor represen- 
tation of a multi-relational network. By means of a series of operations on 
two-way "slices" of this tensor, a "semantically-rich" n x n single-relational 
path matrix can be derived. The resulting path matrix represents a single- 
relational network. This single-relational network may then be subjected to 
any of the known single-relational network analysis algorithms. Thus, the 
presented path algebra can be used to expose multi-relational networks to 
single-relational network analysis algorithms. 
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